m 

SPLICE FORM 1: 

1 CGTCCTTCCT GGTCCTGCGG 

51 GGGCCGTGCC CGGTGCCAGC 

101 TGGTCACAGT GAGCCGTTCG 

151 GGGCCCTGGG ACCAGGCGGT 

201 GAGCTTTGCG GTGCTCCGTG 

251 ACAATGATGA TGCAGCAGAG 

301 AGTGAGGAGG AGCTCCACGG 

351 GAGTCCCCAG AAGCAGGAGG 

401 AGCTGCTGAG GCCGCAGGAT 

4 51 CCCCGGCCTC CCCGGCTCCG 

501 AGAAGGTCTG AGCCAGGATG 

551 ACAGCAGCTC CCCCAGCTGC 

601 GACACCCAGG TGTACTTAGA 

651 TGGGCAAAGC CGGATCTTCA 

701 CACTCCAGGT ATTGCACCAA 

751 GTACCGGGTG GCAGTGCCCT 

801 GAACTCCGAA CAGAGCTGCC 

851 AGTCTCTGCG GCCTCCCAGC 

901 CAGATGGAGC AGGCGATCCG 

951 TGACCTGGAG AGTGTCACTT 

1001 GCCTGGGGCT CCCCCTCCAG 

1051 CTGCTGCTGG TGGCACAGCG 

1101 CTACCTGGGC TCAGAGTGGA 

1151 ACAGGGTCAC CCACATCTTG 

1201 CCTGAGCGCT TCACCTACCA 

1251 CCAGCTGCTG CCGCACTGGA 

1301 GAGCACAGGG CACCCACGTG 

1351 TCAGCGGCCA CAGTGCTGGC 

1401 GGAGCAGGCC CTGCGCCACG 

1451 ACCCTGGCTT CCTGCGCCAG 

1501 AGAACCTGAG GGTGGTGGGG 

1551 AGGCAGCCCC GAAAGAAGAG 

1601 GGGGTCATGA GGTCCATCAG 

1651 CACCTCAGAG ACCAGTGACA 

1701 CACATGAAGA GCCTCTGCAG 

17 51 GGCCAGCAGG TGGACAGGGG 

1801 AGTGGTTACC CTCCAGGGCA 

1851 TCCAGGAGCA GGAGCAGGGG 

1901 TCCTCTACGC CCAGGTTCCG 

1951 CAGTGGAGAG GAGGGCGAGG 

2001 TGACACTGAA GAGGATCCAC 

2051 TGCCGCACAC ATTCCTCTCA 

2101 ACCTCCCACC CCTGTCACTA 

2151 CCTCACCTCC TACAGCCTTA 

2201 GCTCAAGACT TTCTAACTGG 

2251 GGGGCAACAG CACCCTAGTT 

2301 CTGTGGCACG GAATGAAAAC 

2351 CCCACCCCCG CCCCCTCCCT 

2401 GGAACCAGCC AGGCCAGGCA 

2451 TCAGGCCCCA GCCGCGGGAG 

2501 TCCACCTCCA CCGGTCCAGG 

2551 CACCACGCCA GATCACAGGG 

2601 CTTTCTGGCC TCTGGCTAGT 

2651 TTGTACTGAG AAATAAAACA 

2701 AAAA 



GTCCAGGACT GTCCGCGGGG TTGAGGGAAG 
CCAGGTGCTC GCGGCCTGGC TCCATGGCCC 
CCCCCGGGCA GCGGCGCCTC CACGCCCGTG 
CCAGCGAAGG AGTCGACTCC AGCGAAGGCA 
GGGCTGTCCT GGGACTGCAG GATGGAGGGG 
GCCAGTTCTG AGCCAACAGA GAAGGCCCCG 
GGACCAGACA GACTTCGGGC AAGGATCCCA 
AGCAGAGGCA GCACCTGCAC CTCATGGTAC 
GACATCCGCC TGGCAGCCCA GCTGGAGGCA 
CTACCTGCTG GTAGTTTCTA CACGAGAAGG 
AGACGGTCCT CCTGGGCGTG GATTTCCCTG 
ACCCTGGGCC TGGTCTTGCC CCTCTGGAGT 
TGGAGACGGG GGCTTCAGCG TGACGTCTGG 
AGCCCATCTC CATCCAGACC ATGTGGGCCA 
GCATGTGAGG CAGCTCTAGG CAGCGGCCTT 
CACCTGGGCC AGCCACTACC AGGAGAGACT 
TCAATGAGTG GACGGCTATG GCCGACCTGG 
GCCGAGCCTG GCGGGTCCTC AGAACAGGAG 
TGCTGAGCTG TGGAAAGTGT TGGATGTCAG 
CCAAAGAGAT CCGCCAGGCT CTGGAGCTGC 
CAGTACCGTG ACTTCATCGA CAACCAGATG 
GGACCGAGCC TCCCGCATCT TCCCCCACCT 
ACGCAGC7\AA CCTGGAGGAG CTGCAGAGGA 
AACATGGCCC GGGAGATTGA CAACTTCTAC 
CAATGTGCGC CTCTGGGATG AGGAGTCGGC 
AGGAGACGCA CCGCTTCATT GAGGCTGCAA 
CTGGTCCACT GCAAGATGGG CGTCAGCCGC 
CTATGCCATG AAGCAGTACG AATGCAGCCT 
TGCAGGAGCT CCGGCCCATC GCCCGCCCCA 
CTGCAGATCT ACCAGGGCAT CCTGACGGCC 
AGGAGAAGGT TGTAGGCATG GAAGAGAGCC 
CCTGGGCCAC GGCCACGTAT AAACCTCCGA 
TCTTCTGGAG CCCTCCTTGG AGCTGGAGAG 
TGCCAGAGGT CTTCTCTTCC CACGAGTCTT 
CCCTTCCCAC AGCTTGCAAG GACCAAGGGA 
GCCTCAGCCT GCCCTGAAGT CCCGCCAGTC 
GTGCCGTGGT GGCCAACCGG ACCCAGGCCT 
CAGGGGCAGG GGCAGGGAGA GCCCTGCATT 
GAAGGTGGTG AGACAGGCCA GCGTGCATGA 
CCTGAGCGCT CACACATGCC CACGCTCCCC 
AACTCCTTGG AGAAACACCC TCACGTCTGT 
GCTCCGCCCC ATACCCGTCA CTACAGCCTC 
CGGCCTCACC TCCCACCCCT GTCACTACAG 
AGTCCCAGGC CCATGTCTGC CTGTCCAAGG 
GATGTGGTAG AGGGACTGAA GGTACCTTTG 
TCATTCTCAA CTCTAGCCCT GCACACTCAC 
AGAGCTTCCC GTGCAAAAAG GGTCACGCCT 
GCACCTCCTG TCCTCTCCCA GTTCATTCCT 
ACCAGTGGCC CCCAAAGGCA GGCAGGATCC 
GCTGGAAGGG CTGGCAGATC GCTTCCCTCA 
TCTTTGCTGC TGTCCCCAGA CCTCCTGTGA 
CACCAGGCCA GAGATAGTCT TCTTTTTGTC 
CAGTTTTTCA TAGCCTTACA GTATCTGGCT 
CATTTTCATA AAAAAAAAAA A7VAAAAAAAA 



FEATURES : 

5' UTR: 1-93 
Start: 94-1506 
Stop: 1509 
3' UTR: 1510-2704 
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-"^^ ^.ccooccoT oc„ -™c 

51 GGCTCCATGG CCCTGGTCAC ^GTCAGCCGT ^^^^^^ agGAGTCGAC 

101 CTCCACGCCC GTGGGGCCCT GGGACCAGGC ^ ^^^^^ CCTGGGACTG 

151 TCCAGCGAAG GCAGAGCTTT GCGGTGCTCC G ^^^^^ CTGAGCCAAC 

201 CAGGATGGAG GGGACAATGA ^GATGCAGCA ^^^^^^ ACAGACTTCG 

251 AGAGAAGGCC CCGAGTGAGG AGGAGCTCCA ^^^^^ GCAGCACCTG 

301 GGCAAGGATC CCAGAGTCCC CAGAAGCAGG J ^^^^^^ GCCTGGCAGC 

351 CACCTCATGG TACAGCTGCT GAGGCCGCAG G ^^^^^^^ cTGGTAGTTT 

111 CCAGCTGGAG GCACCCCGGC CTCCCCGGCT CCTCCTGGGC 

451 CTACACGAGA AGGAGAAGGT CTGAGCCAGG ^^^^^^ GCCTGGTCTT 

501 GTGGATTTCC CTGACAGCAG CTCCCCCAGC ^^^^^^ gGGGGCTTCA 

551 GCCCCTCTGG AGTGACACCC ^GGTGTACTT ^^^^^^ CTCCATCCAG 

601 GCGTGACGTC TGGTGGGCAA ^GCCGGATCT ^^^^^^ aGGCAGCTCT 
arCATGTGGG CCACACTCCA GGTATTGCAt^ rrTCACCTGG GCCAGCCACT 

01 agSgcggc cttgtaccgg gtggcagtg^ SSSga gtggacggct 
751 accaggagag actgaactcc gaacagagct g ^^^g^ ctggcgggtc 
801 atggccgacc tggagtctct gcggcctccc ^ ctgtggaaag 

851 CTCAGAACAG GAGCAGATGG ^GCAGGCGAT CCG ^^tCCGCCAG 

901 TGTTGGATGT CAGTGACCTG GAGAGTGTCA C ^^^^ GTGACTTCAT 

951 GCTCTGGAGC TGCGCCTGGG GCTCCCCCTC ^ ^^^^^^ gCCTCCCGCA 

1001 CGACAACCAG ATGCTGCTGC TGGTGGCACA ^^^^^^ AAACCTGGAG 

1051 TCTTCCCCCA CCTCTACCTG GGCTCAGAGT ^^^^^ CCCGGGAGAT 
fioi GAGCTGCAGA GGAACAGGGT CACCCACATC ^ ^^^^^.^ CGCCTCTGGG 

1151 TGACAACTTC TACCCTGAGC GCTTCACCl ^^^^^^ GCACCGCTTC 

1201 ATGAGGAGTC GGCCCAGCTG CTGCCGCACT ^^^^^ aCTGCAAGAT 

1251 ATTGAGGCTG CAAGAGCACA GGGCACCCAC ^ atGAAGCAGT 

1301 GGGCGTCAGC CGCTCAGCGG CCACAGTGCT ^G ^^^^^ gCTCCGGCCC 

1351 ACGAATGCAG CCTGGAGCAG GCCCTGCGCC ^ tCTACCAGGG 

1401 ATCGCCCGCC CCAACCCTGG CTTCCTGCGC AAAGTGGGTG 

1451 CATCCTGACG GCCAGCCGCC JGAGCCATGT acCATTCCCA 

1501 GGGTCTCCCC AGAGGAGCAC CCAGCCCCTG ^^^^^^ TTGTAGGCAT 

1551 CTTCTTCCGC CAGAACCTGA ^GGTGGTGG ^^^^^^ CGGCCACGTA 

ieOl GGAAGAGAGC CAGGCAGCCC CGAAAGAAGA ^^^^^ gCCCTCCTTG 

1551 TAAACCTCCG AGGGGTCATG ^GGTCCATCA ^^^^^ tCTTCTCTTC 

1701 GAGCTGGAGA GCACCTCAGA ^ACCAGTGAC ^ ^^^^^ cAGCTTGCAA 

^n'C^ CCACGAGTCT TCACATGAAG AGCCTCTbCA ^ ^ TGCCCTGAAG 
iloi GGACCAAGGG AGGCCAGCAG GTG^^« SS.GG TGGCCAACCG 

1851 TCCCGCCAGT CAGTGGTTAC CCTCCAGGGC ^^^^^ gGGCAGGGAG 

1901 GACCCAGGCC TTCCAGGAGC AGGAGCAGGG ^ gAGACAGGCC 

I951 AGCCCTGCAT ttcctctacg cccaggttcc gg tcacacatgc 
2001 agcgtgcatg acagtggaga ggagggcgag gcctg^^^^^ g^g^cacc 

2051 CCACGCTCCC CTGACACTGA ^^AGGATCCA ^^^^^ CATACCCGTC 

2101 CTCACGTCTG TTGCCGCACA GMTCCTCTC A ^^^^^ cTCCCACCCC 

2151 ACTACAGCCT CACCTCCCAC CCCTGTCACT ^ cCCATGTCTG 

2201 TGTCACTACA GCCTCACCTC CTACAGCCTT ^^^^^^^^^ gAGGGACTGA 

2251 CCTGTCCAAG GGCTCAAGAC TTTCTAACTG ^ acTCTAGCCC 

2301 aggtaccttt gggggcaaca gcaccctagt tt ^^^cc cgtgcaaaaa 

2351 TGCACACTCA CCTGTGGCAC ^GAATGAAAA G ^^^^^ GTCCTCTCCC 

2401 gggtcacgcc tcccaccccc ^ccccctccc ^^^^^gc CCCCAAAGGC 

2451 AGTTCATTCC TGGAACCAGC CAGGCCAGGC ^ gCTGGCAGAT 

2501 AGGCAGGATC CTCAGGCCCC AGCCGCGGGA GGC cTGTCCCCAG 

iVc^ rrr-TTCCCTC ATCCACCTCC ACCGGTCLAt, „r AGAGATAGTC 

StcSSg acaccacgcc agatcacagg caccaggcc a^ 

ofiM TTCTTTTTGT CCTTTCTGGC CTCTGGCTA^ icATTTTCAT AAAAAAAAAA 
'2 0 AgStCTGGC TTTGTACTGA GAAATA^ SSaAA AAAAAAAAAA 

2851 AA 
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FEATURES : 

5' UTR: 1-56 
Start: 57 
Stop: 2034 
3' UTR: 2037-2852 



SPLICE FORM 3: 

1 CCTGGTCCTG CGGGTCCAGG 

51 TGCCCGGTGC CAGCCCAGGT 

101 CAGTGAGCCG TTCGCCCCCG 

151 TGGGACCAGG CGGTCCAGCG 

201 TGCGGTGCTC CGTGGGGCTG 

251 ATGATGCAGC AGAGGCCAGT 

301 GAGGAGCTCC ACGGGGACCA 

351 CCAGAAGCAG GAGGAGCAGA 

401 TGAGGCCGCA GGATGACATC 

451 CCTCCCCGGC TCCGCTACCT 

501 TCTGAGCCAG GATGAGACGG 

551 GCTCCCCCAG CTGCACCCTG 

601 CAGGTGTACT TAGATGGAGA 

651 AAGCCGGATC TTCAAGCCCA 

701 AGGAGCAGAT GGAGCAGGCG 

751 GTCAGTGACC TGGAGAGTGT 

801 GCTGCGCCTG GGGCTCCCCC 

851 AGATGCTGCT GCTGGTGGCA 

901 CACCTCTACC TGGGCTCAGA 

951 GAGGAACAGG GTCACCCACA 

1001 TCTACCCTGA GCGCTTCACC 

1051 TCGGCCCAGC TGCTGCCGCA 

1101 TGCAAGAGCA CAGGGCACCC 

1151 GCCGCTCAGC GGCCACAGTG 

1201 AGCCTGGAGC AGGCCCTGCG 

1251 CCCCAACCCT GGCTTCCTGC 

1301 CGGCCAGAAC CTGAGGGTGG 

1351 GAGCCAGGCA GCCCCGAAAG 

14 01 CCTCCGAGGG GTCATGAGGT 

1451 CTGGAGAGCA CCTCAGTAGA 

1501 ACGAGTCTTC ACATGAAGAG 

1551 ACCAAGGGAG GCCAGCAGGT 

1601 CCGCCAGTCA GTGGTTACCC 

1651 CCCAGGCCTT CCAGGAGCAG 

17 01 CCCTGCATTT CCTCTACGCC 

1751 CGTGCATGAC AGTGGAGAGG 

1801 ACGCTCCCCT GACACTGAAG 

1851 CACGTCTGTT GCCGCACACA 

1901 TACAGCCTCA CCTCCCACCC 

1951 TCACTACAGC CTCACCTCCT 

2001 TGTCCAAGGG CTCAAGACTT 

2051 GTACCTTTGG GGGCAACAGC 

2101 CACACTCACC TGTGGCACGG 

2151 GTCACGCCTC CCACCCCCGC 

2201 TTCATTCCTG GAACCAGCCA 

2251 GCAGGATCCT CAGGCCCCAG 

2301 CTTCCCTCAT CCACCTCCAC 

2351 CTCCTGTGAC ACCACGCCAG 

2401 CTTTTTGTCC TTTCTGGCCT 

24 51 TATCTGGCTT TGTACTGAGA 

2501 AAAAAAAAAA AAAAA7\AAAA 



ACTGTCCCGC GGGGTTGAGG GAAGGGGCCG 
GCTCGCGGCC TGGCTCCATG GCCCTGGTCA 
GGCAGCGGCG CCTCCACGCC CGTGGGGCCC 
AAGGAGTCGA CTCCAGCGAA GGCAGAGCTT 
TCCTGGGACT GCAGGATGGA GGGGACAATG 
TCTGAGCCAA CAGAGAAGGC CCCGAGTGAG 
GACAGACTTC GGGCAAGGAT CCCAGAGTCC 
GGCAGCACCT GCACCTCATG GTACAGCTGC 
CGCCTGGCAG CCCAGCTGGA GGCACCCCGG 
GCTGGTAGTT TCTACACGAG AAGGAGAAGG 
TCCTCCTGGG CGTGGATTTC CCTGACAGCA 
GGCCTGGTCT TGCCCCTCTG GAGTGACACC 
CGGGGGCTTC AGCGTGACGT CTGGTGGGCA 
TCTCCATCCA GACCATGTGG TCCTCAGAAC 
ATCCGTGCTG AGCTGTGGAA AGTGTTGGAT 
CACTTCCAAA GAGATCCGCC AGGCTCTGGA 
TCCAGCAGTA CCGTGACTTC ATCGACAACC 
CAGCGGGACC GAGCCTCCCG CATCTTCCCC 
GTGGAACGCA GCAAACCTGG AGGAGCTGCA 
TCTTGAACAT GGCCCGGGAG ATTGACAACT 
TACCACAATG TGCGCCTCTG GGATGAGGAG 
CTGGAAGGAG ACGCACCGCT TCATTGAGGC 
ACGTGCTGGT CCACTGCAAG ATGGGCGTCA 
CTGGCCTATG CCATGAAGCA GTACGAATGC 
CCACGTGCAG GAGCTCCGGC CCATCGCCCG 
GCCAGCTGCA GATCTACCAG GGCATCCTGA 
TGGGGAGGAG AAGGTTGTAG GCATGGAAGA 
AAGAGCCTGG GGCCACGGGG CACGTATAAA 
CCATCAGTCT TCTGGAGCCC TCCTTGGGAG 
CCAGTGACAT GCCAGAGGTC TTCTCTTCCC 
CCTCTGCAGC CCTTCCCACA GCTTGCAAGG 
GGACAGGGGG CCTCAGCCTG CCCTGAAGTC 
TCCAGGGCAG TGCCGTGGTG GCCAACCGGA 
GAGCAGGGGC AGGGGCAGGG GCAGGGAGAG 
CAGGTTCCGG AAGGTGGTGA GACAGGCCAG 
AGGGCGAGGC CTGAGCCCTC ACACATGCCC 
AGGATCCACA ACTCCTTGGA GAAACACCCT 
TTCCTCTCAG CTCCGCCCCA TACCCGTCAC 
CTGTCACTAC GGCCTCACCT CCCACCCCTG 
ACAGCCTTAA GTCCCAGGCC CATGTCTGCC 
TCTAACTGGG ATGTGGTAGA GGGACTGAAG 
ACCCTAGTTT CATTCTCAAC TCTAGCCCTG 
AATGAAAACA GAGCTTCCCG TGCAAAAAGG 
CCCCTCCCTG CACCTCCTGT CCTCTCCCAG 
GGCCAGGCAA CCAGTGGCCC CCAAAGGCAG 
CCGCGGGAGG CTGGAAGGGC TGGCAGATCG 
CGGTCCAGGT CTTTGCTGCT GTCCCCAGAC 
ATCACAGGGC ACCAGGCCAG AGATAGTCTT 
CTGGCTAGTC AGTTTTTCAT AGCCTTACAG 
AATAAAACAC ATTTTCATAT TTGGTTAAAA 
AAAAAAAAAA AAAAAAAAAA 
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FEATURES : 

5' UTR: 1-88 
Start: 88 
Stop: 1311 
3' UTR: 1315-2540 



Homologous proteins: 

Top 10 BLAST Hits 

SPLICE FORM 1: 

gi!8923483iref |NP_060327.1| hypothetical protein FLJ20515 >gi I 7 . 
gii6714641|dbj IBAA89534.il (AB036834) MAP kinase phosphatase [D. 
gil8922777!ref |NP_060746.1| hypothetical protein FLJ10928 >gi I 7 . 
gi 1 7242951 Idbj IBAA92536.il (AB037719) KIAA1298 protein [Homo sa. 
gi|73012421gb|AAF56372.1| (AE003750) CG6238 gene product [Droso. 
gi I 4150963 |emb|CAA77232. II (Y18620) DsPTPl protein [Arabidopsis . 
gi|9294518|dbj IBAB02780.il (AB023036) dual-specificity protein . 
gi|6862915!gblAAF30304.1|AC018907_4 (AC018907) putative dual-sp. 
gi|6015037|sp|O54838|DUS5_RAT DUAL SPECIFICITY PROTEIN PHOSPHAT. 
gi|9910432|ref |NP_064570.H mitogen-activated protein kinase ph. 



Score 

576 
337 
233 
143 
124 
113 
113 
94 
92 
90 



e-163 
2e-91 
3e-60 
4e-33 
2e-27 
5e-24 
5e-24 
2e-18 
9e-18 
3e-17 



SPLICE FORM 2: 

gi|8923483|ref 
gi I 6714641 !dbj 
gi|8922777!ref 
gi|7242951idbj 
gi I 9294518 Idbj 
gi|4150963|enib 
gi I 4758212 1 ref 
gi|6679156|ref 
gi| 6862915|gb| 
gi|6015037|sp| 



|NP_060327.1| hypothetical protein FLJ20515 [Homo. 
IBAA89534.il {AB036834) MAP kinase phosphatase [D. 
1NP_060746.1| hypothetical protein FLJ10928 [Homo. 
IBAA92536.il {AB037719) KIAA1298 protein [Homo sa. 
IBAB02780.il (AB023036) dual-specificity protein . 
ICAA77232.il (Y18620) DsPTPl protein [Arabidopsis. 
I NP_004 411 . 1 1 dual specificity phosphatase 8 [Horn. 
I NP_032774 . 1 1 neuronal tyrosine/threonine phospha. 
AAF30304.1|AC018907_4 (AC018907) putative dual-sp. 
054 838 I DUS5 RAT DUAL SPECIFICITY PROTEIN PHOSPHAT. 



576 
340 
229 
162 
113 
113 
99 
96 
94 
92 



e-163 
6e-92 
le-58 
le-38 
8e-24 
8e-24 
3e-19 
2e-18 
6e-18 
2e-17 



SPLICE FORM 3: 

gi|8923483|ref |NP_060327.1| hypothetical protein FLJ20515 [Homo. 
H* gi|8922777|ref |NP_060746.1| hypothetical protein FLJ10928 [Homo. 

\^ gi|6714641|dbj IBAA89534.il (AB036834) MAP kinase phosphatase [D. 

gil7242951|dbj IBAA92536.il (AB037719) KIAA12 98 protein [Homosa. 
Iv: gi I 9294518 Idbj IBAB02780.il (AB023036) dual-specificity protein . 

H' gi I 4150963 I emb I CAA77232.il (Y18620) DsPTPl protein [Arabidopsis. 

H gi|6862915|gb|AAF30304.1|AC018907_4 (AC018907) putative dual-sp. 

gi I 6015037 I sp|054838 I DUS5_RAT DUAL SPECIFICITY PROTEIN PHOSPHAT. 

gi|9910432|ref |NP_064570.1| mitogen-activated protein kinase ph. 

gi|9911130|gb|AAA64693.2| (U15932) protein phosphatase [Homosa. 



410 
233 
224 
143 
113 
113 
94 
92 
90 
90 



e-113 
7e-60 
5e-57 
5e-33 
le-23 
le-23 
5e-18 
2e-17 
7e-17 
le-16 



BLAST to dbEST: 
SPLICE FORM 1: 





Score 




E 


gi 19807071 /dataset=dbest /taxon=960 . . . 


1404 


0 


.0 


gi 110317998 /dataset=dbest /taxon=96. . . 


1316 


0 


.0 


gi 110151079 /dataset=dbest /taxon=96. . . 


1249 


0 


.0 


gi 110401153 /dataset=dbest /taxon-960. . . 


1180 


0 


.0 


gi 110329921 /dataset=dbest /taxon=96. . . 


1124 


0 


.0 


gi 17632969 /dataset=dbest /taxon=960 . . . 


791 


0 


.0 


gi 19155111 /dataset=dbest /taxon=9606 . . . 


779 


0 


.0 


gi 110994242 /dataset=dbest /taxon=96. . . 


450 


e- 


-124 
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EXPRESSION INFORMATION FOR MODULATORY USE: 
SPLICE FORM 1: 

library source: 

Expression information from BLAST dbEST hits: 

gi I 9807071 Human Pancreas 

gi 1 10317998 H\aman colon adenocarcinoma 

gi 1 10151079 Human Pancreas : adenocarcinoma 

gi 1 10401153 Human Pancreas : epitheliod carcinoma 

gi 1 10329921 Human lung: large cell carcinoma 

gi 1 7632969 Human kidney: renal cell carcinoma 

gi I 9155111 Human Placenta choriocarcinoma 

gi 1 10994242 Human ovary tumor tissue 



Expression information from PCR-based tissue screening panels: 

Human Brain 

Human Fetal brain 

Human fetal heart 

Human fetal kidney 

Human heart 

Human kidney 

Human uterus 

Human thyroid 



iH 

n 

O 
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SPLICE FORM.l: 

1 MALVTVSRSP PGSGASTPVG PWDQAVQRRS RLQRRQSFAV LRGAVLGLQD 
51 GGDNDDAAEA SSEPTEKAPS EEELHGDQTD FGQGSQSPQK QEEQRQHLHL 
101 MVQLLRPQDD IRLAAQLEAP RPPRLRYLLV VSTREGEGLS QDETVLLGVD 
151 FPDSSSPSCT LGLVLPLWSD TQVYLDGDGG FSVTSGGQSR IFKPISIQTM 
201 WATLQVLHQA CEAALGSGLV PGGSALTWAS HYQERLNSEQ SCLNEWTAMA 
251 DLESLRPPSA EPGGSSEQEQ MEQAIRAELW KVLDVSDLES VTSKEIRQAL 
301 ELRLGLPLQQ YRDFIDNQML LLVAQRDRAS RIFPHLYLGS EWNAANLEEL 
351 QRNRVTHILN MAREIDNFYP ERFTYHNVRL WDEESAQLLP HWKETHRFIE 
401 AARAQGTHVL VHCKMGVSRS AATVLAYAMK QYECSLEQAL RHVQELRPIA 
451 RPNPGFLRQL QIYQGILTAR T ^^^^^^ ^ . 

SPLICE FORM 2: 



1 


MALVTVSRSP 


PGSGASTPVG 


51 


GGDNDDAAEA 


SSEPTEKAPS 


101 


MVQLLRPQDD 


IRLAAQLEAP 


151 


FPDSSSPSCT 


LGLVLPLWSD 


201 


WATLQVLHQA 


CEAALGSGLV 


251 


DLESLRPPSA 


EPGGSSEQEQ 


301 


ELRLGLPLQQ 


YRDFIDNQML 


351 


QRNRVTHILN 


MAREIDNFYP 


401 


AARAQGTHVL 


VHCKMGVSRS 


451 


RPNPGFLRQL 


QIYQGILTAS 


501 


PPEPEGGGEE 


KWGMEESQA 


551 


ESTSETSDMP 


EVFSSHESSH 


601 


QSWTLQGSA 


WANRTQAFQ 


651 


HDSGEEGEA 





SPLICE FORM 3: 



1 


MALVTVSRSP 


PGSGASTPVG 


51 


GGDNDDAAEA 


SSEPTEKAPS 


101 


MVQLLRPQDD 


IRLAAQLEAP 


151 


FPDSSSPSCT 


LGLVLPLWSD 


201 


WSSEQEQMEQ 


AIRAELWKVL 


251 


FIDNQMLLLV 


AQRDRASRIF 


301 


EIDNFYPERF 


TYHNVRLWDE 


351 


KMGVSRSAAT 


VLAYAMKQYE 


401 


QGILTART 





FEATURES : 

Functional domains and key regions : 
SPLICE FORM 1: 

[1] PDOC00004 PS00004 CAMP_PHOSPHO_SITE 

cAMP- and cGMP-dependent protein kinase phosphorylation site 



34-37 RRQS 



[2] PDOC00005 PS00005 PKC_PHOSPHO_SITE 
Protein kinase C phosphorylation site 



Number of matches: 6 

1 65-67 TEK 

2 132-134 STR 

3 254-256 SLR 

4 292-294 TSK 

5 395-397 THR 

6 468-470 TAR 
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[3] PDOC00006 PS00006 CK2_PH0SPH0_SITE 
Casein kinase II phosphorylation site 

Number of matches: 6 

1 70-73 SEEE 

2 132-135 STRE 

3 140-143 SQDE 

4 266-269 SEQE 

5 286-289 SOLE 

6 292-295 TSKE 

[4] PDOC00007 PS00007 TYR_PHOSPHO_SITE 
Tyrosine kinase phosphorylation site 

363-369 REIDNFY 

[5] PDOC00008 PS00008 MYRISTYL 
N-myristoylation site 

Niomber of matches: 8 



1 


12 


-17 


GSGAST 


2 


43 


-48 


GAVLGL 


3 


47 


-52 


GLQDGG 


4 


218- 


223 


GLVPGG 


5 


223- 


228 


GSALTW 


6 


339- 


344 


GSEWNA 


7 


416- 


421 


GVSRSA 


8 


465- 


470 


GILTAR 
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BLAST Alignment to Top Hit: 
SPLICE FORM 1: 

>gi|8923483|ref |NP_060327.1| hypothetical protein FLJ20515 

>gi 17020674 Idbj IBAA91228. II (AK000522) unnamed protein 
product [Homo sapiens] 
Length = 394 

Score = 576 bits (1469), Expect - e-163 

Identities = 290/312 (92%), Positives = 296/312 (93%), Gaps = 1/312 (0%) 

Query: 1 MALVTVSRSPPGSGASTPVGPWDQAVQRRSRLQRRQSFAVLRGAVLGLQDGGDNDDAAEA 60 

MALVTVSRSPPGSGASTPVGPWDQAVQRRSRLQRRQSFAVLRGAVLGLQDGGDNDDA7VEA 
Sbjct: 1 MALVTVSRSPPGSGASTPVGPWDQAVQRRSRLQRRQSFAVLRGAVLGLQDGGDNDDAAEA 60 

Query: 61 SSEPTEKAPSEEELHGDQTDFGQGSQSPQKQEEQRQHLHLMVQLLRPQDDIRLAAQLEAP 120 

SSEPTEKAPSEEELHGDQTDFGQGSQSPQKQEEQRQHLHLMVQLLRPQDDIRLAAQLEAP 
Sbjct: 61 SSEPTEKAPSEEELHGDQTDFGQGSQSPQKQEEQRQHLHLMVQLLRPQDDIRLAAQLEAP 120 

Query: 121 RPPRLRYLLWSTREGEGLSQDETVLLGVDFPDSSSPSCTLGLVLPLWSDTQVYLDGDGG 180 

RPPRLRYLLWSTREGEGLSQDETVLLGVDFPDSSSPSCTLGLVLPLWSDTQVYLDGDGG 
Sbjct: 121 RPPRLRYLLWSTREGEGLSQDETVLLGVDFPDSSSPSCTLGLVLPLWSDTQVYLDGDGG 180 

Query: 181 FSVTSGGQSRIFKPISIQTMWATLQVLHQACEAALGSGLVPGGSALTWASHYQERLNSEQ 240 

FSVTSGGQSRIFKPISIQTMWATLQVLHQACEAALGSGLVPGGSALTWASHYQERLNSEQ 
Sbjct: 181 FSVTSGGQSRIFKPISIQTMWATLQVLHQACEAALGSGLVPGGSALTWASHYQERLNSEQ 240 

Query: 241 SCLNEWTAMADLESLRPPSAEPGGSSEQEQMEQAIRAELWKVLDV-SDLESVTSKEIRQA 2 99 

SCLNEWTAMADLESLRPPSAEPGGSSEQEQMEQAIRAELWKVL++ S E+ E+ + 
Sbjct: 241 SCLNEWTAMADLESLRPPSAEPGGSSEQEQMEQAXRAELWKVLELESTSETSDMPEVFSS 300 

Query: 300 LELRLGLPLQQY 311 

E PLQ + 

Sbjct: 301 HESSHEEPLQPF 312 



>gi I 6714641 Idbj IBAA89534.il (AB036834) MAP kinase phosphatase 
[Drosophila melanogaster ] 
Length = 1045 

Score = 337 bits (854), Expect = 2e-91 

Identities = 204/537 (37%), Positives = 284/537 (51%), Gaps - 81/537 (15%) 

MALVTVSRSPPGSGA-STPVGPWDQAVQRRSRLQRRQSFAVLRGAVLGLQDGGDNDDAAE 59 
MALVTV RSP +G+ SG+ R + F+GLL 
MALVTVQRSPSVAGSCSNSDGESEDDEGNSKGNDRSECFFAGKGTALVL 4 9 



SE L D T +QS + + HL M LL+ +D +++A +LE+ 



?RLRYLLV VSTREGEGLSQDETVLLGVDFPDSSSPS 158 

R RYL++ S R + + +V +G SS S 



Query: 


1 


Sbjct: 


1 


Query: 


60 


Sbjct: 


50 


Query: 


120 


Sbjct: 


108 


Query: 


159 


Sbjct: 


168 


Query: 


175 


Sbjct: 


228 



-CTLGL VLPLWSDTQVY 174 

C LG+ V+P+ +DT ++ 



LDGDGGFSV ++ IFKP+S+Q MW+ LQ LH+ +A+ G+WSY+ 
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Query: 235 RLNSEQSCLNEWTAMADLESLRPPSAEP—GGSSEQEQMEQAIRAELWKVLDVSDLESVT 292 

R+ S+QSCLNEW AM LES RPPS + E+E+ E 1+ +L ++ DL+ VT 

Sbjct: 288 RIESDQSCLNEWNAMDALESRRPPSPDAIRNKPPEKEETESVIKMKLKAIMMSVDLDEVT 347 

Query: 293 SKEIRQALELRLGLPLQQYRDFIDNQMLLLVAQRDRASRIFPHLYLGSEWNAANLEELQR 352 

SK IR LE L + L +Y+ FID -fML+++ Q D ++IF H+YLGSEWNA+NLEELQ+ 
Sbjct: 348 SKYIRGRLEEILDMDLGEYKSFIDAEMLVILGQMDAPTKIFEHVYLGSEWNASNLEELQK 407 

Query: 353 NRVTHILNMAREIDNFYPERFTYHNVRLWDEESAQLLPHWKETHRFIETIARAQGTHVLVH 412 

N V HILN+ REIDNF+P F Y NVR++D+E LL +W +T R+I A+A+G+ VLVH 
Sbjct: 408 NGVRHILNVTREIDNFFPGTFEYFNVRVYDDEKTNLLKYWDDTFRYITRAKAEGSKVLVH 4 67 

Query: 413 CKMGVSRSAATVLAYAMKQYECSLEQALRHVQELRPIARPNPGFLRQLQIYQGILTA 4 69 

CKMGVSRSA+ V+AYAMK Y+ +QAL HV++ R +PN FL QL+ Y G+L A 
Sbjct: 4 68 CKMGVSRSASWIAYAMKAYQWEFQQALEHVKKRRSCIKPNKNFLNQLETYSGMLDA 524 



>gi|8922777 Iref |NP_060746.1| hypothetical protein FLJ10928 

>gi 17023283 I dbj I BAA91913.1! (AK001790} unnamed protein 
product [Homo sapiens] 
Length = 141 

Score = 233 bits (588), Expect = 3e-60 

Identities = 111/111 (100%), Positives = 111/111 (100%) 

Query: 361 MAREIDNFYPERFTYHNVRLWDEESAQLLPHWKETHRFIEAARAQGTHVLVHCKMGVSRS 420 

MAREIDNFYPERFTYHNVRLWDEESAQLLPHWKETHRFIEAARAQGTHVLVHCKMGVSRS 
Sbjct: 31 MAREIDNFYPERFTYHNVRLWDEESAQLLPHWKETHRFIE7VARAQGTHVLVHCKMGVSRS 90 

Query: 421 AATVLAYAMKQYECSLEQALRHVQELRPIARPNPGFLRQLQIYQGILTART 471 

AAT VLAYAMKQ YE CS LEQALRH VQE LRP I ARPN PG FLRQLQ I YQG I LT ART 
Sbjct: 91 AATVLAYAMKQYECSLEQALRHVQELRPIARPNPGFLRQLQIYQGILTART 141 
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1 CGTCCTTCCT GGTCCTGCGG GTCCAGGACT GTCCGCGGGG TTGAGGGAAG 

51 GGGCCGTGCC CGGTGCCAGC CCAGGTGCTC GCGGCCTGGC TCCATGGCCC 

101 TGGTCACAGT GAGCCGTTCG CCCCCGGGCA GCGGCGCCTC CACGCCCGTG 

151 GGGCCCTGGG ACCAGGCGGT CCAGCGAAGG AGTCGACTCC AGCGAAGGCA 

201 GAGCTTTGCG GTGCTCCGTG GGGCTGTCCT GGGACTGCAG GATGGAGGGG 

251 ACAATGATGA TGCAGCAGAG GCCAGTTCTG AGCCAACAGA GAAGGCCCCG 

301 AGTGAGGAGG AGCTCCACGG GGACCAGACA GACTTCGGGC AAGGATCCCA 

351 GAGTCCCCAG 7\AGCAGGAGG AGCAGAGGCA GCACCTGCAC CTCATGGTAC 

401 AGCTGCTGAG GCCGCAGGAT GACATCCGCC TGGCAGCCCA GCTGGAGGCA 

451 CCCCGGCCTC CCCGGCTCCG CTACCTGCTG GTAGTTTCTA CACGAGAAGG 

501 AGAAGGTCTG AGCCAGGATG AGACGGTCCT CCTGGGCGTG GATTTCCCTG 

551 ACAGCAGCTC CCCCAGCTGC ACCCTGGGCC TGGTCTTGCC CCTCTGGAGT 

601 GACACCCAGG TGTACTTAGA TGGAGACGGG GGCTTCAGCG TGACGTCTGG 

651 TGGGCAAAGC CGGATCTTCA AGCCCATCTC CATCCAGACC ATGTGGGCCA 

701 CACTCCAGGT ATTGCACCAA GCATGTGAGG CAGCTCTAGG CAGCGGCCTT 

751 GTACCGGGTG GCAGTGCCCT CACCTGGGCC AGCCACTACC AGGAGAGACT 

801 GAACTCCGAA CAGAGCTGCC TCAATGAGTG GACGGCTATG GCCGACCTGG 

851 AGTCTCTGCG GCCTCCCAGC GCCGAGCCTG GCGGGTCCTC AGAACAGGAG 

901 CAGATGGAGC AGGCGATCCG TGCTGAGCTG TGGAAAGTGT TGGATGTCAG 

951 TGACCTGGAG AGTGTCACTT CCAAAGAGAT CCGCCAGGCT CTGGAGCTGC 

1001 GCCTGGGGCT CCCCCTCCAG CAGTACCGTG ACTTCATCGA CAACCAGATG 

1051 CTGCTGCTGG TGGCACAGCG GGACCGAGCC TCCCGCATCT TCCCCCACCT 

1101 CTACCTGGGC TCAGAGTGGA ACGCAGCAAA CCTGGAGGAG CTGCAGAGGA 

1151 ACAGGGTCAC CCACATCTTG AACATGGCCC GGGAGATTGA CAACTTCTAC 

1201 CCTGAGCGCT TCACCTACCA CAATGTGCGC CTCTGGGATG AGGAGTCGGC 

1251 CCAGCTGCTG CCGCACTGGA AGGAGACGCA CCGCTTCATT GAGGCTGCAA 

1301 GAGCACAGGG CACCCACGTG CTGGTCCACT GCAAGATGGG CGTCAGCCGC 

1351 TCAGCGGCCA CAGTGCTGGC CTATGCCATG AAGCAGTACG AATGCAGCCT 

14 01 GGAGCAGGCC CTGCGCCACG TGCAGGAGCT CCGGCCCATC GCCCGCCCCA 

14 51 ACCCTGGCTT CCTGCGCCAG CTGCAGATCT ACCAGGGCAT CCTGACGGCC 

1501 AGAACCTGAG GGTGGTGGGG AGGAGAAGGT TGTAGGCATG GAAGAGAGCC 

1551 AGGCAGCCCC GAAAGAAGAG CCTGGGCCAC GGCCACGTAT AAACCTCCGA 

1601 GGGGTCATGA GGTCCATCAG TCTTCTGGAG CCCTCCTTGG AGCTGGAGAG 

1651 CACCTCAGAG ACCAGTGACA TGCCAGAGGT CTTCTCTTCC CACGAGTCTT 

1701 CACATGAAGA GCCTCTGCAG CCCTTCCCAC AGCTTGCAAG GACCAAGGGA 

1751 GGCCAGCAGG TGGACAGGGG GCCTCAGCCT GCCCTGAAGT CCCGCCAGTC 

1801 AGTGGTTACC CTCCAGGGCA GTGCCGTGGT GGCCAACCGG ACCCAGGCCT 

1851 TCCAGGAGCA GGAGCAGGGG CAGGGGCAGG GGCAGGGAGA GCCCTGCATT 

1901 TCCTCTACGC CCAGGTTCCG GAAGGTGGTG AGACAGGCCA GCGTGCATGA 

1951 CAGTGGAGAG GAGGGCGAGG CCTGAGCGCT CACACATGCC CACGCTCCCC 

2001 TGACACTGAA GAGGATCCAC AACTCCTTGG AGAAACACCC TCACGTCTGT 

2051 TGCCGCACAC ATTCCTCTCA GCTCCGCCCC ATACCCGTCA CTACAGCCTC 

2101 ACCTCCCACC CCTGTCACTA CGGCCTCACC TCCCACCCCT GTCACTACAG 

2151 CCTCACCTCC TACAGCCTTA AGTCCCAGGC CCATGTCTGC CTGTCCAAGG 

2201 GCTCAAGACT TTCTAACTGG GATGTGGTAG AGGGACTGAA GGTACCTTTG 

2251 GGGGCAACAG CACCCTAGTT TCATTCTCAA CTCTAGCCCT GCACACTCAC 

2301 CTGTGGCACG GAATGAAAAC AGAGCTTCCC GTGCAAAAAG GGTCACGCCT 

2351 CCCACCCCCG CCCCCTCCCT GCACCTCCTG TCCTCTCCCA GTTCATTCCT 

24 01 GGAACCAGCC AGGCCAGGCA ACCAGTGGCC CCCAAAGGCA GGCAGGATCC 

2 4 51 TCAGGCCCCA GCCGCGGGAG GCTGGAAGGG CTGGCAGATC GCTTCCCTCA 

2501 TCCACCTCCA CCGGTCCAGG TCTTTGCTGC TGTCCCCAGA CCTCCTGTGA 

2551 CACCACGCCA GATCACAGGG CACCAGGCCA GAGATAGTCT TCTTTTTGTC 

2 601 CTTTCTGGCC TCTGGCTAGT CAGTTTTTCA TAGCCTTACA GTATCTGGCT 

2 651 TTGTACTGAG AAATAAAACA CATTTTCATA AAAAAAAAAA AAAAAAAAAA 

2701 AAAA 



FEATURES : 

Start: 94 
Exon: 94-1506 
Stop: 1507 
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CHROMOSOME MAP POSITION: 

Bac accession number: AP001885 
Chromosome # : 11 ' 



01 



ALLELIC VARIANTS (SNPs) : 

DNA Protein 

Position Major Minor Domain Position Major Minor 

577 G A Exon 162 G S 

1451 G A Exon 453 S N 

2641 G A Beyond 0RF(3 • ) 

Context : 

DNA 

Position 

577 TCTGAGCCAACAGAGAAGGCCCCGAGTGAGGAGGAGCTCCACGGGGACCAGACAGACTTC 
GGGCAAGGATCCCAGAGTCCCCAGAAGCAGGAGGAGCAGAGGCAGCACCTGCACCTCATG 
GTACAGCTGCTGAGGCCGCAGGATGACATCCGCCTGGCAGCCCAGCTGGAGGCACCCCGG 
CCTCCCCGGCTCCGCTACCTGCTGGTAGTTTCTACACGAGAAGGAGAAGGTCTGAGCCAG 
GATGAGACGGTCCTCCTGGGCGTGGATTTCCCTGACAGCAGCTCCCCCAGCTGCACCCTG 
[G.A] 

GCCTGGTCTTGCCCCTCTGGAGTGACACCCAGGTGTACTTAGATGGAGACGGGGGCTTCA 
GCGTGACGTCTGGTGGGCAAAGCCGGATCTTCAAGCCCATCTCCATCCAGACCATGTGGG 
CCACACTCCAGGTATTGCACCAAGCATGTGAGGCAGCTCTAGGCAGCGGCCTTGTACCGG 
GTGGCAGTGCCCTCACCTGGGCCAGCCACTACCAGGAGAGACTGAACTCCGAACAGAGCT 
GCCTCAATGAGTGGACGGCTATGGCCGACCTGGAGTCTCTGCGGCCTCCCAGCGCCGAGC 

14 51 ACAGGGTCACCCACATCTTGAACATGGCCCGGGAGATTGACAACTTCTACCCTGAGCGCT 
TCACCTACCACAATGTGCGCCTCTGGGATGAGGAGTCGGCCCAGCTGCTGCCGCACTGGA 



ffl AGGAGACGCACCGCTTCATTGAGGCTGCAAGAGCACAGGGCACCCACGTGCTGGTCCACT 
GCAAGATGGGCGTCAGCCGCTCAGCGGCCACAGTGCTGGCCTATGCCATGAAGCAGTACG 
i-^ AATGCAGCCTGGAGCAGGCCCTGCGCCACGTGCAGGAGCTCCGGCCCATCGCCCGCCCCA 
[G,A] 

CCCTGGCTTCCTGCGCCAGCTGCAGATCTACCAGGGCATCCTGACGGCCAGAACCTGAGG 
Q GTGGTGGGGAGGAGAAGGTTGTAGGCATGGAAGAGAGCCAGGCAGCCCCGAAAGAAGAGC 

CTGGGCCACGGCCACGTATAAACCTCCGAGGGGTCATGAGGTCCATCAGTCTTCTGGAGC 
1^ CCTCCTTGGAGCTGGAGAGCACCTCAGAGACCAGTGACATGCCAGAGGTCTTCTCTTCCC 

ACGAGTCTTCACATGAAGAGCCTCTGCAGCCCTTCCCACAGCTTGCAAGGACCAAGGGAG 

13 2641 GGTCACGCCTCCCACCCCCGCCCCCTCCCTGCACCTCCTGTCCTCTCCCAGTTCATTCCT 

14= GGAACCAGCCAGGCCAGGCAACCAGTGGCCCCCAAAGGCAGGCAGGATCCTCAGGCCCCA 
GCCGCGGGAGGCTGGAAGGGCTGGCAGATCGCTTCCCTCATCCACCTCCACCGGTCCAGG 
TCTTTGCTGCTGTCCCCAGACCTCCTGTGACACCACGCCAGATCACAGGGCACCAGGCCA 
GAGATAGTCTTCTTTTTGTCCTTTCTGGCCTCTGGCTAGTCAGTTTTTCATAGCCTTACA 
[G,A] 

TATCTGGCTTTGTACTGAGAAATAAAACACATTTTCATAAAAAAAAAAAAAAAAAAAAAA 
AAA 



GENE STRUCTURE MODEL: 

SPLICE FORM 2 5 ' -el-e2-e3-e4-e5-e6-e7-e8-e9-el0-ell-el2-el3-el4 -3 ' 

SPLICE FORM 1 5'-el-e2-e3-e4-e5-e6-e7-e8-e9-el0-ell-el2 3' 

SPLICE FORM 3 5 ' -el-e2 -e3 -e4 -e5-e6 e8-e9-el0-ell-el2 3' 



MULTIPLE ALIGNMENT OF CDNA SEQUENCES: 

SPLICE FORM 2 TG GTTGAGGGAA 

SPLICE FORM 1 CGTCCTTCCT GGTCCTGCGG GTCCAGGACT GT.CCGCGGG GTTGAGGGAA 
SPLICE FORM 3 CCT GGTCCTGCGG GTCCAGGACT GTCCCGCGGG GTTGAGGGAA 

51 100 
SPLICE FORM 2 GGGGCCGTGC CCGGTGCCAG CCCAGGTGCT CGCGGCCTGG CTCCATGGCC 
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9 • 

SPLICE FORM 1 GGGGCCGTGC CCGGTGCCAG CCCAGGTGCT CGCGGCCTGG CTCCATGGCC 
SPLICE FORM 3 GGGGCCGTGC CCGGTGCCAG CCCAGGTGCT CGCGGCCTGG CTCCATGGCC 



101 

SPLICE FORM 2 CTGGTCACAG 
SPLICE FORM 1 CTGGTCACAG 
SPLICE FORM 3 CTGGTCACAG 



TGAGCCGTTC GCCCCCGGGC 
TGAGCCGTTC GCCCCCGGGC 
TGAGCCGTTC GCCCCCGGGC 



150 

AGCGGCGCCT CCACGCCCGT 
AGCGGCGCCT CCACGCCCGT 
AGCGGCGCCT CCACGCCCGT 



151 

SPLICE FORM 2 GGGGCCCTGG 
SPLICE FORM 1 GGGGCCCTGG 
SPLICE FORM 3 GGGGCCCTGG 



GACCAGGCGG TCCAGCGAAG 
GACCAGGCGG TCCAGCGAAG 
GACCAGGCGG TCCAGCGAAG 



200 

GAGTCGACTC CAGCGAAGGC 
GAGTCGACTC CAGCGAAGGC 
GAGTCGACTC CAGCGAAGGC 



201 

SPLICE FORM 2 AGAGCTTTGC 
SPLICE FORM 1 AGAGCTTTGC 
SPLICE FORM 3 AGAGCTTTGC 

251 

SPLICE FORM 2 GACAATGATG 
SPLICE FORM 1 GACAATGATG 
SPLICE FORM 3 GACAATGATG 



GGTGCTCCGT GGGGCTGTCC 
GGTGCTCCGT GGGGCTGTCC 
GGTGCTCCGT GGGGCTGTCC 



ATGCAGCAGA GGCCAGTTCT 
ATGCAGCAGA GGCCAGTTCT 
ATGCAGCAGA GGCCAGTTCT 



250 

TGGGACTGCA GGATGGAGGG 
TGGGACTGCA GGATGGAGGG 
TGGGACTGCA GGATGGAGGG 

300 

GAGCCAACAG AGAAGGCCCC 
GAGCCAACAG AGAAGGCCCC 
GAGCCAACAG AGAAGGCCCC 



301 350 
SPLICE FORM 2 GAGTGAGGAG GAGCTCCACG GGGACCAGAC AGACTTCGGG CAAGGATCCC 
SPLICE FORM 1 GAGTGAGGAG GAGCTCCACG GGGACCAGAC AGACTTCGGG CAAGGATCCC 
SPLICE FORM 3 GAGTGAGGAG GAGCTCCACG GGGACCAGAC AGACTTCGGG CAAGGATCCC 



IB 



351 400 
SPLICE FORM 2 AGAGTCCCCA GAAGCAGGAG GAGCAGAGGC AGCACCTGCA CCTCATGGTA 
SPLICE FORM 1 AGAGTCCCCA GAAGCAGGAG GAGCAGAGGC AGCACCTGCA CCTCATGGTA 
SPLICE FORM 3 AGAGTCCCCA GAAGCAGGAG GAGCAGAGGC AGCACCTGCA CCTCATGGTA 

401 450 
SPLICE FORM 2 CAGCTGCTGA GGCCGCAGGA TGACATCCGC CTGGCAGCCC AGCTGGAGGC 
SPLICE FORM 1 CAGCTGCTGA GGCCGCAGGA TGACATCCGC CTGGCAGCCC AGCTGGAGGC 
SPLICE FORM 3 CAGCTGCTGA GGCCGCAGGA TGACATCCGC CTGGCAGCCC AGCTGGAGGC 

451 500 
SPLICE FORM 2 ACCCCGGCCT CCCCGGCTCC GCTACCTGCT GGTAGTTTCT ACACGAGAAG 
SPLICE FORM 1 ACCCCGGCCT CCCCGGCTCC GCTACCTGCT GGTAGTTTCT ACACGAGAAG 
SPLICE FORM 3 ACCCCGGCCT CCCCGGCTCC GCTACCTGCT GGTAGTTTCT ACACGAGAAG 



501 

SPLICE FORM 2 GAGAAGGTCT 
SPLICE FORM 1 GAGAAGGTCT 
SPLICE FORM 3 GAGAAGGTCT 



GAGCCAGGAT GAGACGGTCC 
GAGCCAGGAT GAGACGGTCC 
GAGCCAGGAT GAGACGGTCC 



550 

TCCTGGGCGT GGATTTCCCT 
TCCTGGGCGT GGATTTCCCT 
TCCTGGGCGT GGATTTCCCT 



551 

SPLICE FORM 2 GACAGCAGCT 
SPLICE FORM 1 GACAGCAGCT 
SPLICE FORM 3 GACAGCAGCT 



CCCCCAGCTG CACCCTGGGC 
CCCCCAGCTG CACCCTGGGC 
CCCCCAGCTG CACCCTGGGC 



600 

CTGGTCTTGC CCCTCTGGAG 
CTGGTCTTGC CCCTCTGGAG 
CTGGTCTTGC CCCTCTGGAG 



601 

SPLICE FORM 2 TGACACCCAG 
SPLICE FORM 1 TGACACCCAG 
SPLICE FORM 3 TGACACCCAG 



GTGTACTTAG ATGGAGACGG 
GTGTACTTAG ATGGAGACGG 
GTGTACTTAG ATGGAGACGG 



650 

GGGCTTCAGC GTGACGTCTG 
GGGCTTCAGC GTGACGTCTG 
GGGCTTCAGC GTGACGTCTG 



651 

SPLICE FORM 2 GTGGGCAAAG 
SPLICE FORM 1 GTGGGCAAAG 
SPLICE FORM 3 GTGGGCAAAG 



CCGGATCTTC AAGCCCATCT 
CCGGATCTTC AAGCCCATCT 
CCGGATCTTC AAGCCCATCT 



700 

CCATCCAGAC CATGTGGGCC 
CCATCCAGAC CATGTGGGCC 
CCATCCAGAC CATGT 
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701 750 
SPLICE FORM 2 ACACTCCAGG TATTGCACCA AGCATGTGAG GCAGCTCTAG GCAGCGGCCT 
SPLICE FORM 1 ACACTCCAGG TATTGCACCA AGCATGTGAG GCAGCTCTAG GCAGCGGCCT 
SPLICE FORM 3 

751 800 
SPLICE FORM 2 TGTACCGGGT GGCAGTGCCC TCACCTGGGC CAGCCACTAC CAGGAGAGAC 
SPLICE FORM 1 TGTACCGGGT GGCAGTGCCC TCACCTGGGC CAGCCACTAC CAGGAGAGAC 
SPLICE FORM 3 

801 850 
SPLICE FORM 2 TGAACTCCGA ACAGAGCTGC CTCAATGAGT GGACGGCTAT GGCCGACCTG 
SPLICE FORM 1 TGAACTCCGA ACAGAGCTGC CTCAATGAGT GGACGGCTAT GGCCGACCTG 
SPLICE FORM 3 

851 900 
SPLICE FORM 2 GAGTCTCTGC GGCCTCCCAG CGCCGAGCCT GGCGGGTCCT CAGAACAGGA 
SPLICE FORM 1 GAGTCTCTGC GGCCTCCCAG CGCCGAGCCT GGCGGGTCCT CAGAACAGGA 
SPLICE FORM 3 GGTCCT CAGAACAGGA 

901 950 

SPLICE FORM 2 GCAGATGGAG CAGGCGATCC GTGCTGAGCT GTGGAAAGTG TTGGATGTCA 

SPLICE FORM 1 GCAGATGGAG CAGGCGATCC GTGCTGAGCT GTGGAAAGTG TTGGATGTCA 

1^ SPLICE FORM 3 GCAGATGGAG CAGGCGATCC GTGCTGAGCT GTGGAAAGTG TTGGATGTCA 

951 1000 
""'4 SPLICE FORM 2 GTGACCTGGA GAGTGTCACT TCCAAAGAGA TCCGCCAGGC TCTGGAGCTG 

m SPLICE FORM 1 GTGACCTGGA GAGTGTCACT TCCAAAGAGA TCCGCCAGGC TCTGGAGCTG 

SPLICE FORM 3 GTGACCTGGA GAGTGTCACT TCCAAAGAGA TCCGCCAGGC TCTGGAGCTG 

1001 1050 

M SPLICE FORM 2 CGCCTGGGGC TCCCCCTCCA GCAGTACCGT GACTTCATCG ACAACCAGAT 

p SPLICE FORM 1 CGCCTGGGGC TCCCCCTCCA GCAGTACCGT GACTTCATCG ACAACCAGAT 

SPLICE FORM 3 CGCCTGGGGC TCCCCCTCCA GCAGTACCGT GACTTCATCG ACAACCAGAT 

1051 1100 

H SPLICE FORM 2 GCTGCTGCTG GTGGCACAGC GGGACCGAGC CTCCCGCATC TTCCCCCACC 

SPLICE FORM 1 GCTGCTGCTG GTGGCACAGC GGGACCGAGC CTCCCGCATC TTCCCCCACC 

m SPLICE FORM 3 GCTGCTGCTG GTGGCACAGC GGGACCGAGC CTCCCGCATC TTCCCCCACC 

^ 1101 1150 

SPLICE FORM 2 TCTACCTGGG CTCAGAGTGG AACGCAGCAA ACCTGGAGGA GCTGCAGAGG 

SPLICE FORM 1 TCTACCTGGG CTCAGAGTGG AACGCAGCAA ACCTGGAGGA GCTGCAGAGG 

SPLICE FORM 3 TCTACCTGGG CTCAGAGTGG AACGCAGCAA ACCTGGAGGA GCTGCAGAGG 

1151 1200 
SPLICE FORM 2 AACAGGGTCA CCCACATCTT GAACATGGCC CGGGAGATTG ACAACTTCTA 
SPLICE FORM 1 AACAGGGTCA CCCACATCTT GAACATGGCC CGGGAGATTG ACAACTTCTA 
SPLICE FORM 3 AACAGGGTCA CCCACATCTT GAACATGGCC CGGGAGATTG ACAACTTCTA 

1201 1250 
SPLICE FORM 2 CCCTGAGCGC TTCACCTACC ACAATGTGCG CCTCTGGGAT GAGGAGTCGG 
SPLICE FORM 1 CCCTGAGCGC TTCACCTACC ACAATGTGCG CCTCTGGGAT GAGGAGTCGG 
SPLICE FORM 3 CCCTGAGCGC TTCACCTACC ACAATGTGCG CCTCTGGGAT GAGGAGTCGG 

1251 1300 
SPLICE FORM 2 CCCAGCTGCT GCCGCACTGG AAGGAGACGC ACCGCTTCAT TGAGGCTGCA 
SPLICE FORM 1 CCCAGCTGCT GCCGCACTGG AAGGAGACGC ACCGCTTCAT TGAGGCTGCA 
SPLICE FORM 3 CCCAGCTGCT GCCGCACTGG AAGGAGACGC ACCGCTTCAT TGAGGCTGCA 

1301 1350 
SPLICE FORM 2 AGAGCACAGG GCACCCACGT GCTGGTCCAC TGCAAGATGG GCGTCAGCCG 
SPLICE FORM 1 AGAGCACAGG GCACCCACGT GCTGGTCCAC TGCAAGATGG GCGTCAGCCG 
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SPLICE FORM 3 AGAGCACAGG GCACCCACGT GCTGGTCCAC TGCAAGATGG GCGTCAGCCG 

1351 1400 
SPLICE FORM 2 CTCAGCGGCC ACAGTGCTGG CCTATGCCAT GAAGCAGTAC GAATGCAGCC 
SPLICE FORM 1 CTCAGCGGCC ACAGTGCTGG CCTATGCCAT GAAGCAGTAC GAATGCAGCC 
SPLICE FORM 3 CTCAGCGGCC ACAGTGCTGG CCTATGCCAT GAAGCAGTAC GAATGCAGCC 

1401 1450 
SPLICE FORM 2 TGGAGCAGGC CCTGCGCCAC GTGCAGGAGC TCCGGCCCAT CGCCCGCCCC 
SPLICE FORM 1 TGGAGCAGGC CCTGCGCCAC GTGCAGGAGC TCCGGCCCAT CGCCCGCCCC 
SPLICE FORM 3 TGGAGCAGGC CCTGCGCCAC GTGCAGGAGC TCCGGCCCAT CGCCCGCCCC 

1451 1500 
SPLICE FORM 2 AACCCTGGCT TCCTGCGCCA GCTGCAGATC TACCAGGGCA TCCTGACGGC 
SPLICE FORM 1 AACCCTGGCT TCCTGCGCCA GCTGCAGATC TACCAGGGCA TCCTGACG. . 
SPLICE FORM 3 AACCCTGGCT TCCTGCGCCA GCTGCAGATC TACCAGGGCA TCCTGACG.. 



1501 1550 
SPLICE FORM 2 CAGCCGCCAG AGCCATGTCT GGGAGCAGAA AGTGGGTGGG GTCTCCCCAG 

SPLICE FORM 1 

SPLICE FORM 3 

1551 1600 
SPLICE FORM 2 AGGAGCACCC AGCCCCTGAA GTCTCTACAC CATTCCCACT TCTTCCGCCA 

SPLICE FORM 1 GCCA 

SPLICE FORM 3 GCCA 



1601 1650 
SPLICE FORM 2 GAACCTGAGG GTGGTGGGGA GGAGAAGGTT GTAGGCATGG AAGAGAGCCA 
SPLICE FORM 1 GAACCTGAGG GTGGTGGGGA GGAGAAGGTT GTAGGCATGG AAGAGAGCCA 
SPLICE FORM 3 GAACCTGAGG GTGGTGGGGA GGAGAAGGTT GTAGGCATGG AAGAGAGCCA 

1651 1700 
SPLICE FORM 2 GGCAGCCCCG AAAGAAGAGC CTGGG . . CCA CGGCCACGTA TAAACCTCCG 
SPLICE FORM 1 GGCAGCCCCG AAAGAAGAGC CTGGG.. CCA CGGCCACGTA TAAACCTCCG 
SPLICE FORM 3 GGCAGCCCCG AAAGAAGAGC CTGGGGCCAC GGGGCACGTA TAAACCTCCG 



1701 1750 
SPLICE FORM 2 AGGGGTCATG AGGTCCATCA GTCTTCTGGA GCCCTCCTT. GGAGCTGGAG 
SPLICE FORM 1 AGGGGTCATG AGGTCCATCA GTCTTCTGGA GCCCTCCTT. GGAGCTGGAG 
SPLICE FORM 3 AGGGGTCATG AGGTCCATCA GTCTTCTGGA GCCCTCCTTG GGAGCTGGAG 



1751 1800 
SPLICE FORM 2 AGCACCTCAG .AGACCAGTG ACATGCCAGA GGTCTTCTCT TCCCACGAGT 
SPLICE FORM 1 AGCACCTCAG .AGACCAGTG ACATGCCAGA GGTCTTCTCT TCCCACGAGT 
SPLICE FORM 3 AGCACCTCAG TAGACCAGTG ACATGCCAGA GGTCTTCTCT TCCCACGAGT 

1801 1850 
SPLICE FORM 2 CTTCACATGA AGAGCCTCTG CAGCCCTTCC CACAGCTTGC AAGGACCAAG 
SPLICE FORM 1 CTTCACATGA AGAGCCTCTG CAGCCCTTCC CACAGCTTGC AAGGACCAAG 
SPLICE FORM 3 CTTCACATGA AGAGCCTCTG CAGCCCTTCC CACAGCTTGC AAGGACCAAG 

1851 1900 
SPLICE' FORM 2 GGAGGCCAGC AGGTGGACAG GGGGCCTCAG CCTGCCCTGA AGTCCCGCCA 
SPLICE FORM 1 GGAGGCCAGC AGGTGGACAG GGGGCCTCAG CCTGCCCTGA AGTCCCGCCA 
SPLICE FORM 3 GGAGGCCAGC AGGTGGACAG GGGGCCTCAG CCTGCCCTGA AGTCCCGCCA 

1901 1950 
SPLICE FORM 2 GTCAGTGGTT ACCCTCCAGG GCAGTGCCGT GGTGGCCAAC CGGACCCAGG 
SPLICE FORM 1 GTCAGTGGTT ACCCTCCAGG GCAGTGCCGT GGTGGCCAAC CGGACCCAGG 
SPLICE FORM 3 GTCAGTGGTT ACCCTCCAGG GCAGTGCCGT GGTGGCCAAC CGGACCCAGG 



1951 



2000 
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SPLICE FORM 2 CCTTCCAGGA GCAGGAGCAG 
SPLICE FORM 1 CCTTCCAGGA GCAGGAGCAG 
SPLICE FORM 3 CCTTCCAGGA GCAGGAGCAG 

2001 

SPLICE FORM 2 ATTTCCTCTA CGCCCAGGTT 
SPLICE FORM 1 ATTTCCTCTA CGCCCAGGTT 
SPLICE FORM 3 ATTTCCTCTA CGCCCAGGTT 

2051 

SPLICE FORM 2 TGACAGTGGA GAGGAGGGCG 
SPLICE FORM 1 TGACAGTGGA GAGGAGGGCG 
SPLICE FORM 3 TGACAGTGGA GAGGAGGGCG 

2101 

SPLICE FORM 2 CCCTGACACT GAAGAGGATC 

SPLICE FORM 1 CCCTGACACT GAAGAGGATC 

SPLICE FORM 3 CCCTGACACT GAAGAGGATC 

2151 

SPLICE FORM 2 TGTTGCCGCA CACATTCCTC 
SPLICE FORM 1 TGTTGCCGCA CACATTCCTC 
SPLICE FORM 3 TGTTGCCGCA CACATTCCTC 

2201 

SPLICE FORM 2 CTCACCTCCC ACCCCTGTCA 

SPLICE FORM 1 CTCACCTCCC ACCCCTGTCA 

SPLICE FORM 3 CTCACCTCCC ACCCCTGTCA 

2251 

SPLICE FORM 2 CAGCCTCACC TCCTACAGCC 

SPLICE FORM 1 CAGCCTCACC TCCTACAGCC 

SPLICE FORM 3 CAGCCTCACC TCCTACAGCC 

2301 

SPLICE FORM 2 AGGGCTCAAG ACTTTCTAAC 
SPLICE FORM 1 AGGGCTCAAG ACTTTCTAAC 
SPLICE FORM 3 AGGGCTCAAG ACTTTCTAAC 

2351 

SPLICE FORM 2 TTGGGGGCAA CAGCACCCTA 
SPLICE FORM 1 TTGGGGGCAA CAGCACCCTA 
SPLICE FORM 3 TTGGGGGCAA CAGCACCCTA 

2401 

SPLICE FORM 2 CACCTGTGGC ACGGAATGAA 
SPLICE FORM 1 CACCTGTGGC ACGGAATGAA 
SPLICE FORM 3 CACCTGTGGC ACGGAATGAA 

2451 

SPLICE FORM 2 CCTCCCACCC CCGCCCCCTC 
SPLICE FORM 1 CCTCCCACCC CCGCCCCCTC 
SPLICE FORM 3 CCTCCCACCC CCGCCCCCTC 

2501 

SPLICE FORM 2 CCTGGAACCA GCCAGGCCAG 
SPLICE FORM 1 CCTGGAACCA GCCAGGCCAG 
SPLICE FORM 3 CCTGGAACCA GCCAGGCCAG 

2551 

SPLICE FORM 2 TCCTCAGGCC CCAGCCGCGG 

SPLICE FORM 1 TCCTCAGGCC CCAGCCGCGG 

SPLICE FORM 3 TCCTCAGGCC CCAGCCGCGG 



GGGCAGGGGC AGGGGCAGGG AGAGCCCTGC 
GGGCAGGGGC AGGGGCAGGG AGAGCCCTGC 
GGGCAGGGGC AGGGGCAGGG AGAGCCCTGC 

2050 

CCGGAAGGTG GTGAGACAGG CCAGCGTGCA 
CCGGAAGGTG GTGAGACAGG CCAGCGTGCA 
CCGGAAGGTG GTGAGACAGG CCAGCGTGCA 

2100 

AGGCCTGAGC CCTCACACAT GCCCACGCTC 
AGGCCTGAGC CCTCACACAT GCCCACGCTC 
AGGCCTGAGC CCTCACACAT GCCCACGCTC 

2150 

CACAACTCCT TGGAGAAACA CCCTCACGTC 
CACAACTCCT TGGAGAAACA CCCTCACGTC 
CACAACTCCT TGGAGAAACA CCCTCACGTC 

2200 

TCAGCTCCGC CCCATACCCG TCACTACAGC 
TCAGCTCCGC CCCATACCCG TCACTACAGC 
TCAGCTCCGC CCCATACCCG TCACTACAGC 

2250 

CTACGGCCTC ACCTCCCACC CCTGTCACTA 
CTACGGCCTC ACCTCCCACC CCTGTCACTA 
CTACGGCCTC ACCTCCCACC CCTGTCACTA 

2300 

TTAAGTCCCA GGCCCATGTC TGCCTGTCCA 
TTAAGTCCCA GGCCCATGTC TGCCTGTCCA 
TTAAGTCCCA GGCCCATGTC TGCCTGTCCA 

2350 

TGGGATGTGG TAGAGGGACT GAAGGTACCT 
TGGGATGTGG TAGAGGGACT GAAGGTACCT 
TGGGATGTGG TAGAGGGACT GAAGGTACCT 

2400 

GTTTCATTCT CAACTCTAGC CCTGCACACT 
GTTTCATTCT CAACTCTAGC CCTGCACACT 
GTTTCATTCT CAACTCTAGC CCTGCACACT 

2450 

AACAGAGCTT CCCGTGCAAA AAGGGTCACG 
AACAGAGCTT CCCGTGCAAA AAGGGTCACG 
AACAGAGCTT CCCGTGCAAA AAGGGTCACG 

2500 

CCTGCACCTC CTGTCCTCTC CCAGTTCATT 
CCTGCACCTC CTGTCCTCTC CCAGTTCATT 
CCTGCACCTC CTGTCCTCTC CCAGTTCATT 

2550 

GCAACCAGTG GCCCCCAAAG GCAGGCAGGA 
GCAACCAGTG GCCCCCAAAG GCAGGCAGGA 
GCAACCAGTG GCCCCCAAAG GCAGGCAGGA 

2600 

GAGGCTGGAA GGGCTGGCAG ATCGCTTCCC 
GAGGCTGGAA GGGCTGGCAG ATCGCTTCCC 
GAGGCTGGAA GGGCTGGCAG ATCGCTTCCC 
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2601 2650 

SPLICE FORM 2 TCATCCACCT CCACCGGTCC AGGTCTTTGC TGCTGTCCCC AGACCTCCTG 

SPLICE FORM 1 TCATCCACCT CCACCGGTCC AGGTCTTTGC TGCTGTCCCC AGACCTCCTG 

SPLICE FORM 3 TCATCCACCT CCACCGGTCC AGGTCTTTGC TGCTGTCCCC AGACCTCCTG 

2651 2700 

SPLICE FORM 2 TGACACCACG CCAGATCACA GGGCACCAGG CCAGAGATAG TCTTCTTTTT 

SPLICE FORM 1 TGACACCACG CCAGATCACA GGGCACCAGG CCAGAGATAG TCTTCTTTTT 

SPLICE FORM 3 TGACACCACG CCAGATCACA GGGCACCAGG CCAGAGATAG TCTTCTTTTT 

2701 2750 

SPLICE FORM 2 GTCCTTTCTG GCCTCTGGCT AGTCAGTTTT TCATAGCCTT ACAGTATCTG 

SPLICE FORM 1 GTCCTTTCTG GCCTCTGGCT AGTCAGTTTT TCATAGCCTT ACAGTATCTG 

SPLICE FORM 3 GTCCTTTCTG GCCTCTGGCT AGTCAGTTTT TCATAGCCTT ACAGTATCTG 

2751 2800 

SPLICE FORM 2 GCTTTGTACT GAGAAATAAA ACACATTTTC ATAAAAAAAA AAAAAAAAAA 

SPLICE FORM 1 GCTTTGTACT GAGAAATAAA ACACATTTTC ATAAAAAAAA AAAAAAAAAA 

SPLICE FORM 3 GCTTTGTACT GAGAAATAAA ACACATTTTC ATATTTGGTT AAAAAAAT^U^ 



ill 



2801 2850 
SPLICE FORM 2 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 

SPLICE FORM 1 AAAAAAA - — — — ^^^^^^^^^^ — 

SPLICE FORM 3 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAA--- 

2851 2894 
SPLICE FORM 2 AAAAAAAAAA AAAAAAAA7UV AAAAAAAAAA AAAAAAAAAA AAAA 

SPLICE FORM 1 — 

SPLICE FORM 3 — ^^^^^^^^^^ ^^^^ 



Si. 



MULTIPLE ALIGNMENT OF PEPTIDE SEQUENCES: 

SPLICE FORM 2 MALVTVSRSP PGSGASTPVG PWDQAVQRRS RLQRRQSFAV LRGAVLGLQD 

SPLICE FORM 1 MALVTVSRSP PGSGASTPVG PWDQAVQRRS RLQRRQSFAV LRGAVLGLQD 

SPLICE FORM 3 MALVTVSRSP PGSGASTPVG PWDQAVQRRS RLQRRQSFAV LRGAVLGLQD 

51 100 

SPLICE FORM 2 GGDNDDAAEA SSEPTEKAPS EEELHGDQTD FGQGSQSPQK QEEQRQHLHL 

SPLICE FORM 1 GGDNDDAAEA SSEPTEKAPS EEELHGDQTD FGQGSQSPQK QEEQRQHLHL 

SPLICE FORM 3 GGDNDDAAEA SSEPTEKAPS EEELHGDQTD FGQGSQSPQK QEEQRQHLHL 



101 150 

SPLICE FORM 2 MVQLLRPQDD IRLAAQLEAP RPPRLRYLLV VSTREGEGLS QDETVLLGVD 

SPLICE FORM 1 MVQLLRPQDD IRLAAQLEAP RPPRLRYLLV VSTREGEGLS QDETVLLGVD 

SPLICE FORM 3 MVQLLRPQDD IRLAAQLEAP RPPRLRYLLV VSTREGEGLS QDETVLLGVD 

151 200 

SPLICE FORM 2 FPDSSSPSCT LGLVLPLWSD TQVYLDGDGG FSVTSGGQSR IFKPISIQTM 

SPLICE FORM 1 FPDSSSPSCT LGLVLPLWSD TQVYLDGDGG FSVTSGGQSR IFKPISIQTM 

SPLICE FORM 3 FPDSSSPSCT LGLVLPLWSD TQVYLDGDGG FSVTSGGQSR IFKPISIQTM 

201 250 
SPLICE FORM 2 WATLQVLHQA CEAALGSGLV PGGSALTWAS HYQERLNSEQ SCLNEWTAMA 
SPLICE FORM 1 WATLQVLHQA CEAALGSGLV PGGSALTWAS HYQERLNSEQ SCLNEWTAMA 
SPLICE FORM 3 W 

251 300 
SPLICE FORM 2 DLESLRPPSA EPGGSSEQEQ MEQAIRAELW KVLDVSDLES VTSKEIRQAL 
SPLICE FORM 1 DLESLRPPSA EPGGSSEQEQ MEQAIRAELW KVLDVSDLES VTSKEIRQAL 
SPLICE FORM 3 SSEQEQ MEQAIRAELW KVLDVSDLES VTSKEIRQAL 



301 



350 



FIGURE 3, page 7 of 8 



SPLICE FORM 2 ELRLGLPLQQ YRDFIDNQML 
SPLICE FORM 1 ELRLGLPLQQ YRDFIDNQML 
SPLICE FORM 3 ELRLGLPLQQ YRDFIDNQML 

351 

SPLICE FORM 2 QRNRVTHILN MAREIDNFYP 
SPLICE FORM 1 QRNRVTHILN MAREIDNFYP 
SPLICE FORM 3 QRNRVTHILN MAREIDNFYP 

401 

SPLICE FORM 2 AARAQGTHVL VHCKMGVSRS 
SPLICE FORM 1 AARAQGTHVL VHCKMGVSRS 
SPLICE FORM 3 AARAQGTHVL VHCKMGVSRS 



LLVAQRDRAS RIFPHLYLGS EWNAANLEEL 
LLVAQRDRAS RIFPHLYLGS EWNAANLEEL 
LLVAQRDRAS RIFPHLYLGS EWNAANLEEL 

400 

ERFTYHNVRL WDEESAQLLP HWKETHRFIE 
ERFTYHNVRL WDEESAQLLP HWKETHRFIE 
ERFTYHNVRL WDEESAQLLP HWKETHRFIE 

450 

AATVLAYAMK QYECSLEQAL RHVQELRPIA 
AATVLAYAMK QYECSLEQAL RHVQELRPIA 
AATVLAYAMK QYECSLEQAL RHVQELRPIA 



451 500 
SPLICE FORM 2 RPNPGFLRQL QIYQGILTAS RQSHVWEQKV GGVSPEEHPA PEVSTPFPLL 

SPLICE FORM 1 RPNPGFLRQL QIYQGILTAR T — 

SPLICE FORM 3 RPNPGFLRQL QIYQGILTAR T--— 



501 550 

SPLICE FORM 2 PPEPEGGGEE KWGMEESQA APKEEPGPRP RINLRGVMRS ISLLEPSLEL 

SPLICE FORM 1 ----^-^-^^^^ ^ ^^^^ ^ 

SPLICE FORM 3 ^-^^^^ ^ ^ ^ ^^^^ 

551 600 
SPLICE FORM 2 ESTSETSDMP EVFSSHESSH EEPLQPFPQL ARTKGGQQVD RGPQPALKSR 

SPLICE FORM 1 — — ^^^^^ ^ ^ 

SPLICE FORM 3 — — 

601 650 
SPLICE FORM 2 QSVVTLQGSA WANRTQAFQ EQEQGQGQGQ GEPCISSTPR FRKWRQASV 

SPLICE FORM 1 — 

SPLICE FORM 3 - — 

651 

SPLICE FORM 2 HDSGEEGEA 

SPLICE FORM 1 

SPLICE FORM 3 — 
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